Categories

Versions

You are viewing the RapidMiner Studio documentation for version 10.0 - Check here for latest version

Extract Length (Text Processing)

Synopsis

Extracts the document's length and adds it to the document's meta data.

Description

This operator counts the number of characters in each token of the document and will add it to the meta data of the document. Therefore the key is used that might be specified as parameter. If the key already exists, it will be overwritten. Please keep in mind, that the meta data might be added as attribute after the processing of the documents, depending on the parameter of the Process Documents operator.

Input

  • document

    The document port.

Output

  • document

    The document port.

Parameters

  • metadata_keyThe number of characters will be added under this key. The key will become the name of the attribute after document processing. Range: